o 



oo 

(N 



Novel Architectures and Algorithms for Delay 
Reduction in Back-pressure Scheduling and 

Routing 



Loc Bui, R. Srikant, Alexander Stolyar 



Abstract 

The back-pressure algorithm is a well-known throughput-optimal algorithm. However, its delay 
performance may be quite poor even when the traffic load is not close to network capacity due to 
the following two reasons. First, each node has to maintain a separate queue for each commodity in the 
O . network, and only one queue is served at a time. Second, the back -pressure routing algorithm may route 

some packets along very long routes. In this paper, we present solutions to address both of the above 
issues, and hence, improve the delay performance of the back-pressure algorithm. One of the suggested 
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solutions also decreases the complexity of the queueing data structures to be maintained at each node. 



I. Introduction 

O " 

^ | Resource allocation in wireless networks is complicated due to the shared nature of wireless medium. 

One particular allocation algorithm called the back-pressure algorithm which encompasses several layers 
of the protocol stack from MAC to routing was proposed by Tassiulas and Ephremides, in their seminal 
paper [1]. The back-pressure algorithm was shown to be throughput-optimal, i.e., it can support any arrival 
rate vector which is supportable by any other resource allocation algorithm. Recently, it was shown that 
the back-pressure algorithm can be combined with congestion control to fairly allocate resources among 
competing users in a wireless network O-Q, thus providing a complete resource allocation solution 
from the transport layer to the MAC layer. While such a combined algorithm can be used to perform a 
large variety of resource allocation tasks, in this paper, we will concentrate on its application to scheduling 
and routing. 
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Even though the back-pressure algorithm delivers maximum throughput by adapting itself to network 
conditions, there are several issues that have to be addressed before it can be widely deployed in practice. 
As stated in the original paper 0], the back-pressure algorithm requires centralized information and 
computation, and its computational complexity is too prohibitive for practice. Much progress has been 
made recently in easing the computational complexity and deriving decentralized heuristics. We refer the 
interested reader to (H, Q and references within for some recent results along these lines. We do not 
consider complexity or decentralization issues in this paper; our proposed solutions can be approximated 
well by the solutions suggested in the above papers. 

Besides complexity and decentralization issues which have received much attention recently, the back- 
pressure algorithm can also have poor delay performance. To understand that, we consider two different 
network scenarios: one in which the back-pressure algorithm is used to adaptively select a route for each 
packet, and the other in which a flow's route is chosen upon arrival by some standard multi-hop wireless 
network routing algorithm such as DSR or AODV and the back-pressure algorithm is simply used to 
schedule packets. We refer to the first case as adaptive-routing and the second case as fixed-routing, 
respectively. 

We first discuss networks with fixed routing. The back-pressure algorithm assigns a weight to each 
flow on each link. The weight is equal to the flow's queue backlog at the transmitter of the link minus 
the flow's queue backlog at the receiver. The weight of a link is equal to the maximum weight of any 
flow that uses the link. The back-pressure algorithm then selects a schedule which maximizes the sum of 
the weights of the links included in the schedule. Under such an algorithm, for a link to be scheduled, 
its weight should be slightly larger than zero. Now, let us consider a flow that traverses K links, and use 
an informal argument to show why it is very intuitive that the flow's total queue accumulation along its 
route should grow quadratically with the route length. The queue length at the destination for this flow 
is equal to zero. The queue length at the first upstream node from the destination will be some positive 
number, say, e. The queue length at the second upstream node from the destination will be even larger 
and for the purposes of obtaining insight, let us say that it is 2e. Continuing this reasoning further, the 
total queue length for the flow will be e(l + 2 + . . . + K) = Q(K 2 ). Thus, the total backlog on a path 
is intuitively expected to grow quadratically in the number of hops. On the other hand, suppose a fixed 
service rate is allocated to each flow on each link on its path, then the queue length at each hop will 
be roughly 0(1) depending on the utilization at that link. With such a fixed service rate allocation, the 
total end-to-end backlog should then grow linearly in the number of hops. However, such an allocation 
is possible only if the packet arrival rate generated by each flow is known to the network a priori. One 
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of the contributions of this paper is to use counters called shadow queues introduced in iflOl to allocate 
service rates to each flow on each link in an adaptive fashion without knowing the set of packet arrival 
rates. 

We will also show that the concept of shadow queues can reduce the number of real queues maintained 
at each node significantly. In particular, we will show that it is sufficient to maintain per-neighbor queues at 
each node, instead of per-flow queues required by the back-pressure algorithm in the case of fixed routing. 
In large networks, the number of flows is typically much larger compared to the number of neighbors 
of each node, thus using per-neighbor queues can result in significant reduction in implementation 
complexity. A different idea to reduce the number of queues at each node has been proposed in ifTTTl . but 
the implementation using shadow queues has the additional benefit of delay reduction. 

Next, we discuss the case of adaptive routing. The back-pressure algorithm tends to explore many 
routes to find sufficient capacity in the network to accommodate the offered traffic. Since the goal of 
the algorithm is to maximize throughput, without considering Quality of Service (QoS), back-pressure 
based adaptive routing can result in very long paths leading to unnecessarily excessive delays. In this 
paper, we propose a modification to the back-pressure algorithm which forces it to first explore short 
paths and only use long paths when they are really needed to accommodate the offered traffic. Thus, 
under our proposed modification, the back-pressure algorithm continues to be throughput-optimal, but it 
pays attention to the delay performance of the network. We also refer the reader to a related work in 
lfl2l where the authors use the same cost function as us, but their formulation is different and hence their 
solution is also different. 

II. System Model 

Let us consider a network modeled by a graph, Q = (J\f, C) , where Af is the set of nodes and C is the 
set of links. We assume that time is slotted, with a typical time slot denoted by t. If a link (re, m) is in 
C, then it is possible to transmit packets from node re to node m subject to the interference constraints 
which will be described shortly. 

We let T be the set of flows that share the network resources. Packets of each flow enter the network at 
one node, travel along multiple hops (which may or may not pre-determined), and then exit the network 
at another node. For each flow / G J 7 , let b(f) denote the begin (entering) node, and e(f) denote the 
end (exiting) node of flow /. 

We define a valid schedule tt = fcfjcj, . . . > C |£N to be a set of link rates (measured in terms of 
number of packets) that can be simultaneously supported. Note that due to the interference between 
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links, for each ir, some cf could be zero. Moreover, we make a natural and nonrestrictive assumption 
that if 7r is a valid schedule, then if we replace any subset of its components by zeros, the modified 
schedule is valid as well. We also assume that cf is upper-bounded by some c max for any it and /. Let 
r be the set of all possible valid schedules, and co(T) denote the convex hull of T. 

If the routes of flows are not predetermined, i.e., when adaptive routing is used, then the capacity 
region A of the network is defined as the set of all flow rates which are supportable by the network. 
Tassiulas and Ephremides |1] have shown that A = {Aj}^ e ^. G A if 

. there exists a \i = {^nm} (n, m )eC G co(T), 

• for any link (n, m) 6 C, there exists some allocation |^nm| such that p nm = J2feT^ m > 
and 

• for any node n E J\f, for all flows f £ J 7 , n ^ e(/), 

X f I {n=b(f)} + J2^kn= Yl Vim- 

(fc,n) (n,m) 

The term pj nm can be interpreted as the long-term average rate that link (n, m) allocates to serve flow 
/. Note that the equation in the third bullet above is simply the law of flow conservation. 

Now, if the routes of flows are predetermined, i.e., when fixed routing is used, then for each / G J 7 , 
let L(f) denote the set of links forming the route of /. The capacity region A of the network is defined 
as the set of all flow rates which are supportable given a set of flows and their corresponding routes. In 
the case of fixed routing, A = {A/}^ eJF € A if there exists a p, = {pi} le/ r G co{T) such that 

The traffic in the network can be elastic or inelastic. If the traffic is inelastic, i.e., the flows' rates are 
fixed (and within the capacity region), then the goal is to route/schedule the traffic through the network 
while ensuring that the queues in the network are stable. If the traffic is elastic, then the goal is to 
allocate the network's resources to all flows in some fair manner. More precisely, suppose that each flow 
has a utility function associated with it. The utility function of flow /, denoted by Uj(-), is defined as 
a function of the data rate Xf sent by flow /, and assumed to be concave and nondecreasing. The goal, 
in the case of elastic traffic, is to determine the optimal solution to the following resource allocation 
problem: 

max ^2 Uf(xf) (1) 

S.t. £ € A, 
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where A is the capacity region described above. 

It has been shown that, for inelastic traffic, the back-pressure algorithm is throughput-optimal. Fur- 
thermore, for elastic traffic, a joint congestion control and back-pressure routing/scheduling algorithm 
has been shown to be able to solve the resource allocation problem (fl]). However, as we mentioned in 
Section H the delay performance of such algorithms can be quite poor. In the subsequent sections, we 
describe our architectures and algorithms in detail. 

III. The Shadow Algorithm 

In this section, we consider networks with fixed routing, and propose an architecture to reduce delays 
and reduce the number of queues maintained at each node. The main idea is to use a fictitious queueing 
system called the shadow queueing system to perform flow control and resource allocation in the network 
while using only a single physical FIFO queue for each outgoing link (also known as per-neighbor 
queueing) at each node. The idea of shadow queues was introduced in iflOl . but the main goal there 
was to extend the network utility maximization framework for wireless networks to include multicast 
flows. However, one of the main points of this work is to show that shadow queues can be useful 
even in networks with unicast flows only for the purpose of delay reduction. Further, the idea of using 
per-neighbor queueing and establishing its stability is new here. 

A. Description 

The traditional back-pressure algorithm requires the queue length of every flow that passes through a 
node to perform resource allocation. The main idea of the shadow algorithm is to decouple the storage 
of this information from the queueing data structure required to store packets at each node. The details 
of the shadow algorithm are described as follows. 

Queues and Counters: At each node, instead of keeping a separate queue for each flow as in the back- 
pressure algorithm, a FIFO (first-come first-served) queue is maintained for each outgoing link. This 
FIFO queue stores packets for all flows going through the corresponding link. When a node receives a 
packet, it looks at the packet's header: if the node is not the final destination of that packet, it will send 
the packet to the FIFO queue of the next-hop link; otherwise, it will deliver the packet to the upper layer. 
We let P nm [i] denote the length of the queue maintained at link (n, m) and at the beginning of time slot 
t. 

Each node maintains a separate shadow queue (i.e., a counter) for each flow going through it. Let Qn[t] 
be the length of the shadow queue (i.e., the value of the counter) of flow / at node n at the beginning 
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of time slot t. The shadow queues and real queues are updated according to the scheduling algorithm 
described next. Note that each node still needs to keep a separate shadow queue for every flow going 
through it, but these are just counters, not actual physical queues. A counter is much easier to implement 
than a physical queue. 

Back-pressure scheduling using the shadow queue lengths: At time slot t, 

• Each link looks at the maximum shadow differential backlog of all flows going through that link: 



Back-pressure scheduling: 



w nm [t] = max [Q f n [t] - Q S m [t\) . (2) 



TT*[t] = max 22 Cn m W nm [t]. (3) 
(n,m) 



• A schedule it* = (c^cj, . . . is interpreted by the network as follows: link (n,m) transmits 

(f nm shadow packets from the shadow queue of the flow whose differential backlog achieves the 
maximum in (f2]) (if the shadow queue has fewer than c^ m packets, then it is emptied); link (n, m) 
also transmits as many real packets as shadow packets from its real FIFO queue. Again, if the 
number of real packets in the queue is less than the number of transmitted shadow packets, then all 
the real packets are transmitted. 
We recall that shadow queues are just counters. The action of "transmitting shadow packets" is simply 
the action of updating the counters' values. In other words, "transmitting" k shadow packets from Qn 
to Qm means that we subtract k from Qn and add k to Qm- From the above description, it should be 
clear that the shadow packets can be interpreted as permits which allow a link to transmit. Unlike the 
traditional back-pressure algorithm, the permits are associated with just a link rather than with a link and 
a flow. 

Congestion control at the source: At time slot t, the source of flow / computes the rate at which it 
injects packets into the ingress shadow queue as follows: 

x f [t]= min | U'f 1 (^jf^j , x max | , (4) 

where x max is an upper-bound of the arrival rates, and M is a positive parameter. The source also 
generates real traffic at rate j3xf[t] where (3 is a positive number less than 1. If xj and j3xf are not 
integers, the actual number of shadow and real packets generated can be random variables with these 
expected values. Since the shadow packets are permits that allow real-packet transmission, from basic 
queueing theory, it follows that the actual packet arrival rate must be slightly smaller than the shadow 
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packet arrival rate to ensure the stability of real queues. The parameter (3 is chosen to be less than 1 for 
this purpose. As we will see later in simulations, the queue backlog in the network would be smaller for 
smaller values of j3. 

The above description of the shadow algorithm applies to elastic traffic. For inelastic traffic, the same 
shadow algorithm can be used without congestion control. To ensure stability of the real queues, if the 
real arrival rate of an inelastic flow is Xf, the shadow arrival rate for this flow must be larger than Xf. 
For example, if we wish to make the shadow arrival rate larger than the real arrival rate by a factor 
of (1 + e), it can be accomplished as follows: for every real packet arrival, generate a shadow packet. 
Generate an additional shadow packet for each real packet with probability e. This procedure ensures 
that the shadow arrival rate will be (1 + e) times the real arrival rate. For the algorithm to be stable, the 
set of arrival rates {A/(l + e)}j must lie in the interior of capacity region. 

Alternatively, the shadow algorithm for inelastic traffic can be implemented slightly differently if we 
are willing to tolerate packet loss: fix the shadow arrival rate for each flow and regulate the arrival rate 
of real packets to be a fraction (3 of the shadow arrival rate. For example, if the rate of shadow arrivals 
in a time slot is Xf, then one can inject real packets according to a Poisson distribution of mean /3Xf. 
The real packets could be stored in a queue at its ingress node, and drained at this rate to inject into the 
network. If the mean real arrival rate is larger than f3 times the mean shadow arrival rate, then the real 
packet buffer at the edge of the network will overflow leading to packet loss. Packet loss is unavoidable 
for inelastic flows unless the arrival rate is less than the capacity that the network is willing to allocate 
to the flow. The shadow arrival rate in this case should be thought of as the network-designated capacity 
for a flow. 

We note that the concept of shadow queues here is different from the notion of virtual queues used 
in |[T3l for the Internet and in [HI for wireless networks. In networks with virtual queueing systems, the 
arrival rates to both the real and virtual queues are the same, but the virtual queue is drained at a slower 
rate than the real queue. Instead, here the arrival rates to the real queues are slightly smaller than the 
arrival rates to the corresponding shadow queues. This subtle difference is important in that it allows us 
to use per-neighbor FIFO queues and prove stability in a multihop wireless network in the next section. 

B. Stability of the shadow algorithm 

In this subsection, we establish the optimality and stability of the real and shadow queues. First, we 
note that the optimality of the resource allocation and the stability of shadow queues follow from previous 
results in the literature. In particular, we have the following theorem. 
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Theorem 1: The shadow-queue-based congestion control and scheduling algorithms described in Sec- 
tion InTA] above asymptotically achieve the optimal rate allocation, i.e., 

T-l 

^ t S E ™ =x * + °(VM), (5) 

where x* is the optimal solution to (HJ. Furthermore, the shadow queues are stable in the sense that the 
Markov chain of shadow queues Q[t] is positive recurrent and the steady-state expected values of the 
shadow queue lengths are bounded as follows: 

5>(Q/[oo]) = 0(M). 

n,f 

Proof: The proof of this theorem was presented in (3j , Q, and hence, is omitted here. ■ 
The remaining goal is to prove the stability of the real queues. Note that the sources are sending real 
traffic with smaller rates than shadow traffic, and we know that the shadow queues are stable. However, it 
does not automatically mean that the real queues are stable as well, since each of them is an aggregated 
FIFO queue storing packets for all flows going through its corresponding link. Fortunately, we can apply 
results from the stochastic networks literature to establish the following result. 

Theorem 2: The process describing the joint evolution of both shadow and real queues, 

((«' [t /6 w (iW(1) <"->«)' 

is an irreducible, aperiodic, positive recurrent Markov chain. Therefore, the real FIFO queues are also 
stable. 

The proof is based on the fluid limit approach and a result by Bramson [14]. In his paper, Bramson 
proved that fluid models of Kelly-type FIFO queueing networks are stable as long as the nominal load on 
each server is strictly less than its capacity. Thus, the basic idea of the proof is as follows. The random 
process describing the behavior of shadow queues, under the joint congestion control and scheduling 
algorithm (running on the shadow system), is positive recurrent (as specified in Theorem [[])■ Therefore, 
the average service rate on each network link that the shadow algorithm yields is strictly greater than the 
nominal load of the link due to the thinning of actual traffic; moreover, the (random) cumulative amount 
of service provided on each link up to time t satisfies the functional strong law of large numbers, as t 
goes to infinity. As a result, if we take the fluid limit of the process describing real FIFO queues, it has 
exactly same form as if each network link would have constant, non-time-varying capacity ( equal to the 
average rate provided by the shadow algorithm). Then, this fluid limit is stable by the results of lfl4l . 
which implies stability of the process of real queues. The proof's details are presented in Appendix |A] 
just for the purpose of completeness. 
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Fig. 1. The linear network with N links. 

Note that the real traffic throughput will always be slightly smaller than the optimal solution to (Q]), 
but this difference from the optimal solution can be made arbitrarily small by adjusting the parameter f3. 

IV. Performance Comparison: Back-Pressure Algorithm versus the Shadow 

Algorithm 

In this section, we compare and contrast the performances of the traditional back-pressure algorithm 
and the shadow algorithm for networks with fixed routing. 

A. Elastic traffic under the traditional back-pressure algorithm 

We present simple calculations in this section to get some feel for the performance of the traditional 
back-pressure algorithm when it is used with congestion control. Deriving expressions for the queue 
backlog for general-topology networks seems quite hard, so we confine our discussions to the case of a 
linear network with N links as in Figure [TJ There are N + 1 flows sharing this network: one flow (indexed 
0) goes through all N links, and N other flows (indexed 1 to N) where each of them goes through each 
link. The capacity of each link is c, and for simplicity, we assume that there is no interference between 
links. 

Let Xi and Ui(-) denote the rate and the utility function flow i, respectively. The network utility 
maximization problem we would like to solve is as follows: 



N 



max 




i=0 



s.t. 



^0 < /"0,1 



,N-1 



Xi + Ho,i < c, i = 1, 



N. 



where ^o.i is the resource that link i allocates to serve flow 
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If the utility is logarithmic (proportional fairness), i.e., Ui(x) = log(x), then one can easily compute the 
optimal rates and optimal queue lengths (which are the Lagrange multipliers) for the above optimization 
problem as follows: 

c 

* * * 

x — ^0,1 — • • • — ^0,N — _|_ > 

*_ * _Nc 

X l----- X N-J^ [ , 

N + 1 

Qi = Qo,i ~ Qo,i+i = -j^r , i = l,...,N, (6) 

where q* and q^ i are the optimal queue lengths maintained at node i for flow % and flow 0, respectively. 
Then, the end-to-end total queue length for flow is 

1=1 1=1 

For a more general class of utility functions which model a large class of fairness concepts |[T31 . 

^.1— a 

U l {x) = , a > 0, 

1 — a 

we still have similar results: 

x* = ^,i = • • • = ti,N = © (N- 1 /*) , 

x * 1 = ... = x* N = e(i), 

?i =«o,i-?o,i+l = ©(l)i i = h---,N, (7) 

which again lead to YliLi 1o % = ® (-^ 2 ) ■ ^ s mentioned in the Introduction section, if a fixed rate (larger 
than its arrival rate) is allocated to each flow, then the total queue length in this network is expected to 
increase as the order of N instead of N 2 . 



B. Inelastic traffic under the traditional back-pressure algorithm 

In the previous subsection, we showed that the combined back-pressure and congestion control algo- 
rithm for elastic traffic can lead to quadratic end-to-end queueing delay in terms of the number of hops. 
It is interesting to see whether such a behavior can also be observed in the case of inelastic traffic, i.e., 
the flows' rates are fixed, and the traditional back-pressure algorithm is used. The following theorem 
establishes an upper-bound on the end-to-end queue backlog for any flow. 

Theorem 3: Consider a general topology network accessed by a set of flows with fixed routes. Let 
Kmax be the maximum number of hops in the route of any flow, i.e., K max = maxy Suppose the 
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arrival rate vector A is such that, for some e > 0, (1 + e)A lies in the interior of the capacity region of 
the network. Assume that the arrival processes of the flows are independent of each other, independent 
from time slot to time slot, and have finite second moments. Then, the expected value of the sum of 
queue lengths (in steady-state) along the route of any flow / is bounded as follows: 



E 



nefi(/) 



1 + e b 2 
e \f 



where constant b > depends only on c max . 

Proof: The proof is presented in Appendix |B] ■ 
While the above result is only an upper bound, it suggests the quadratic growth of the total flow queue 
length on the flow route length. The simulation results shown next validate such quadratic growth. 

C. Simulation results for inelastic traffic 

To illustrate the queue length behavior under back-pressure algorithm in the case of inelastic traffic, 
we simulate the linear network in Figure [T] We choose N = 40, i.e., the network has 41 nodes and 40 
links, with no interference between links. Each link has capacity 10, i.e., it can transmit up to 10 packets 
per time slot. Let Ao be the fixed rate of flow 0, and Ai be the fixed rate of flows 1,2,..., 40. We know 
that the back-pressure algorithm will stabilize the network as long as Ao + Ai < 10. We let the sources 
send shadow traffic at fixed rates Aj, and send real traffic at a slightly smaller rate /3Aj, with j3 € (0, 1). 

Figure [2] shows the mean queue lengths of all queues maintained at each node when Ao = 5 and 
Ai = 2.5. The value of j3 here is 0.99. We see that the shadow queue lengths of flow increase nearly 
linearly when going from the end node to the begin node, which leads to a quadratic growth (in terms of 
the number of hops) of the end-to-end queue backlog. Moreover, we also see that the real FIFO queue 
lengths are significantly reduced, even with a small amount thinning of traffic (1%). 

D. Simulation results for elastic traffic 

In this subsection, we investigate the performance of the shadow algorithm with elastic traffic in a 
network with a more complicated topology than a line. In particular, we consider a grid network as shown 
in Figure [3] We assume that all flows have a logarithmic utility function, i.e., Uf(xf) = logXf for all /. 
The network has 16 nodes (represented by circles) and 24 links (represented by dash lines). We assume 
a simple one-hop interference model under which a matching in the graph represents a valid schedule. 
Each link has a capacity of 10, i.e., it can transmit up to 10 packets in one time slot if scheduled. There 
are 48 flows (represented by arrows) sharing this network. 
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Fig. 2. The queue lengths at each node in the linear network in Figure[T] The solid-line boxes are the lengths of shadow queues 
of flow (the long flow) maintained at each node. The dash-line boxes are the shadow queue lengths of flows i, i = 1, . . . , 40, 
(the short flows) at node i, respectively. Finally, the dot-line boxes are the real FIFO queue lengths at each node. 




Fig. 3. A grid network with 16 nodes, 24 links, and 48 flows. Links and flows are represented by dash lines and solid arrows, 
respectively. 
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Fig. 4. The evolutions of total shadow queue length and total real queue lengths with different values of /3 over time. 

We implement the shadow algorithm as described in Section UlI-AI with the parameter M = 1000. In 
Figure 01 we plot the evolution of total shadow queue length and total real queue length for several values 
of parameter (3 (the total queue length is the sum of all queue lengths in the network). Note that the 
shadow queue length is also the queue length of traditional back-pressure scheduling without the shadow 
algorithm. The figure indicates that the total real queue length with the shadow algorithm decreases 
dramatically compared to the traditional back-pressure algorithm. Since the shadow queue length is very 
large compared to the real queue lengths, it is hard to see the actual values of the real queue lengths in 
the figure, so we present some numbers from the simulations here: after a half million time slots, the 
total shadow queue length is around 82 000 while the total real queue lengths are only about 2000, 800, 
and 500, when j3 is 0.99, 0.97, and 0.95, respectively. Thus, significant gains in performance can be 
realized at the expense of a small loss in throughput (represented by the parameter 1 — /3). Note that the 
traditional back-pressure algorithm can perform poorly due to many reasons: (i) As in Section HV-A[ if 
the number of hops for a flow is large, then the queue backlog can increase quadratically. (ii) The choice 
of the parameter M in the congestion control algorithm (see Equation I®) can lead to queue backlogs 
of the order of M (see the upper bound in Theorem Q] and simulation results in (H). (iii) A separate 
queue is maintained for each destination. The shadow algorithm solves all of these problems at once by 
"reserving" capacity between each source-destination pair, i.e., for each flow. 
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V. Min-Resource Routing using Back-Pressure Algorithm 

In this section, we consider wireless networks where each flow's route is not pre-determined, but is 
adaptively chosen by the back-pressure algorithm for each packet. As mentioned in the Introduction 
section, the back-pressure algorithm explores all paths in the network and as a result may choose paths 
which are unnecessarily long and may even contain loops, thus leading to poor performance. We address 
this problem by introducing a cost function which measures the total amount of resources used by all 
the flows in the network. Specifically, we add up traffic load on all links in the network and use this as 
our cost function. In the case of inelastic flows, the goal then is to minimize this cost subject to network 
capacity constraints. Due to the nature of the cost function, in a network with links of equal capacity, 
shorter hop paths will be preferred over longer hop paths. 

In the case of elastic flows, one can maximize the sum of flow utilities minus a weighted function 
of the cost described above, where the weight provides a tradeoff between network utility and resource 
usage. Since the solutions to both problems are similar, we only present the inelastic case here. 



A. Description 

Given a set of packet arrival rates that lie within the capacity region, our goal is to find the routes 
for flows such that as few network resources as possible are used. Thus, we formulate the following 
optimization problem: 

min ^ (8) 

(n,m) 

s.t. X f l {n=b(f)} +^ fi f kn < ^m, V / G F,n E AT, 

(k,n) (n,m) 

{fJ-nm}(n,m)eC S Co(T), 

where \J nm is the rate that link (n,m) allocates to serve flow /, i.e., /i nm = Ylf^nm, and Xf is the 
fixed rate of flow /. An algorithm that asymptotically solves the min-resource routing problem ([8]) is as 
follows. (It is a special case of the algorithm in [3], where the scaling parameter l/M is called /3.) 
Min-resource routing by back-pressure: At time slot t, 

• Each node n maintains a separate queue of packets for each destination d; its length is denoted 
Q„[t]. Each link is assigned a weight 

w nm [t\ = max (JjQftt] ~ ^QtM ~ l) , (9) 
where M > is a parameter (having the same meaning as earlier in this paper). 
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Scheduling/routing rule: 



n*[t] = max V] ir nm w nm [t]. 



(10) 



(n,m) 



• If the schedule n* says, for example, to send c K nm packets over link (re, rre), then link (n, m) transmits 
up to c^ m packets from the queue to Q d m for the destination d achieving the maximum in (|9]). 
Note that the above algorithm does not change if we replace the weights in (© by the following, 
re-scaled ones: 

(11) 



w nm [t] = max Q [t] - Q d m [t] — Ml. 

deD \ 1 

Therefore, compared with the traditional back-pressure scheduling/routing, the only difference is that 

each link weight is equal to the maximum differential backlog minus parameter M. (M = reverts the 

algorithm to traditional.) 

The performance of the stationary process which is "produced" by the algorithm with fixed parameter 
M is within 0(1/M) of the optimal (analogously to ([5])): 

J>* m =0(1/M), 

(n,m) 

where /x* is an optimal solution to ([8]). However, larger M means larger O(M) queues and slower 
convergence to the (nearly optimal) stationary regime. On the other hand, "too small" M results in a 
stationary regime being "too far" from optimal, and queues being large for that reason. Therefore, a good 
value for M for a practical use should be neither too large nor too small. Our simulations confirm these 
intuitions. 



E 






(n,m) 



B. Simulation results 

We ran a simulation for a network with 8 nodes, 10 links, and 2 flows as in Figure [5] We assume 
the node-exclusive spectrum sharing interference model, i.e., each node can only communicate with at 
most one other node in any time slot. Each link has a capacity of 10 when activated. Flow enters the 
network at node 3 and exits at node 4, while flow 1 enters at node 1 and exits at node 6. Note that the 
flows' routes have not been prespecified, and the described above algorithm with parameter M is used. 

We fix each flow's rate at value A. It is easy to see that under the node-exclusive spectrum sharing 
interference model, the back-pressure algorithm can stabilize the network as long as A < 10. The arrival 
processes are Poisson, i.e., the number of arrivals for each flow at each time slot is a Poisson random 
variable with mean A. Each simulation run was 1 million time-slots long and 40 such runs were performed. 
The results reported are averaged over these 40 runs. 
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Flow 

— K!> 



Flow 1 

© © © 



Flow 

©— 



© © 



Flow 1 



Fig. 5. A network with 8 nodes, 10 links, and 2 flows. Each link has a capacity of 10. Flow enters at node 3 and exits at 
node 4, while flow 1 enters at node 1 and exits at node 6. 



Table U shows the rate allocation of each link to each flow when the value of A is fixed at 5.0 and for 
M = 0, 10, and 20. Note that M = corresponds to the traditional back-pressure algorithm. We see that 
the traditional back-pressure algorithm uses all links in the network, while our modified back-pressure 
algorithm (with M = 10 or M = 20) essentially uses only link (3, 4) for flow and link (1, 6) for flow 
1 (which are the min-resource routes for these flows). 

We then turn our attention to the queue backlog (the sum of all queue lengths) in the network. Figure [6] 
shows the queue backlog in the network corresponding to various values of A. We see that the queue 
backlog of our modified back-pressure algorithm with M = 10 is significantly smaller than that of the 
traditional back-pressure algorithm. However, when M is increased to 20, the delay performance gets 
slightly worse. This result confirms our observation about the trade-off in choosing the value of M which 
is discussed at the end of Section IV-AI 



VI. Conclusions 

In this paper, we have proposed a new shadow architecture to improve the delay performance of 
back-pressure scheduling algorithm. The shadow queueing system allows each node to maintain a single 
FIFO queues for each of its outgoing links, instead of keeping a separate queue for each flow in the 
network. This architecture not only reduces the queue backlog (or, equivalently, delay by Little's law) 
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TABLE I 

The link's rate allocation for network in Figure[5]when each flow's rate is A = 5.0. 





M 


= 


M = 


= 10 


M = 


= 20 


Link 


Rate for 


Rate for 


Rate for 


Rate for 


Rate for 


Rate for 


(n, m) 


flow 


flow 1 


flow 


flow 1 


flow 


flow 1 


(0,1) 


1.9492 


1.9671 


0.0000 


0.0000 


0.0000 


0.0000 


(1.2) 


1.9759 


1.5622 


0.0000 


0.0000 


0.0000 


0.0000 


(0,3) 


1.9055 


2.0058 


0.0000 


0.0000 


0.0000 


0.0000 


(1,6) 


0.0563 


2.2417 


0.0000 


4.9998 


0.0000 


5.0001 


(2,4) 


1.4913 


2.4595 


0.0000 


0.0000 


0.0000 


0.0000 


(3,4) 


2.2504 


0.0466 


4.9993 


0.0000 


4.9996 


0.0000 


(3,5) 


1.5551 


1.9881 


0.0000 


0.0000 


0.0000 


0.0000 


(4,7) 


1.2590 


2.0853 


0.0000 


0.0000 


0.0000 


0.0000 


(5,6) 


2.3592 


1.5055 


0.0000 


0.0000 


0.0000 


0.0000 


(6,7) 


2.0094 


1.2535 


0.0000 


0.0000 


0.0000 


0.0000 
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but also reduces the number of actual physical queues that each node has to maintain. Next, we proposed 
an algorithm that forces the back-pressure algorithm to use the minimum amount of network resources 
while still maintaining throughput optimality. This results in better delay performance compared to the 
traditional back-pressure algorithm. 

We presented the shadow algorithm for the case of fixed routing, i.e., the route for each flow is fixed. 
The shadow algorithm can also be used in the case of adaptive routing, but a node cannot use just one 
FIFO queue for each neighbor. If one still maintains a separate queue for each destination at each node, 
then the extension of the shadow algorithm to the case of adaptive routing is straightforward. On the 
other hand, it would be interesting to study if a single per-neighbor FIFO queue can be maintained even 
in the case of adaptive routing. This is an interesting topic for future research. 

Appendix A 
Proof of Theorem [2] 

In this appendix, we provide details of the proof of Theorem [2l First, recall the result from Theorem Q] 
that 

1 T_1 
t=o 

where x*(e) is within e-boundary of the optimal solution x* and e can be made arbitrarily small by 
increasing M. To simplify the notations, from now on, we will drop e in x*(e). In other words, we will 
use the notation x* for the e-approximate optimal solution. 

From the above result, the following convergence results can be established. 

Lemma 1: For every flow / G J 7 , 

1 T_1 

^Zl a /W T -^° P x } m - s -> 

t=o 

i.e., the time average of real packet arrivals also converges to the optimal solution. 
Proof: 

Consider any flow /. We have the sequence of flow rates {x/[0], a?/[l], • • •} and the sequence of 
generated shadow packets {a/[0], a/[l], • • •} . Note that given the sequence of flow rates {^/[i]}^ , 
a/[t]'s are independent Poisson random variables with means (3xf[t]'s. For simplicity, we drop the 
subscript / in the notations within the scope of this proof. 
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Let x[t] := E[x[t]], and a[t] := E[a[t]] = E[(3x[t\] = 0x[t]. We have that 



E 



T \ 

f J>[i]-/Js* 



J^2 



E 



k i=l 



^E[(a[z]-/3x*) 2 ] 



2^2 



8=1 



T T 



+ ^E E - Px*)(a\j] - Px*)} 



Now, for z 7^ j, 



Ccw(a[i], a[ 



= E[(a[i]-a[i])(ab1 - a[ 

= E[E[(a[i]-aH)(ab"]-ab1)l{^W}r=o]] 
= 0, 

i.e., a[i] and a[j] are uncorrelated. The last equality is due to the fact that a[i] and a[j] are independent 
given the sequence {xk}^L - Hence, 

E [(a\i] - Px*)(a[j] - fix*)] = E [(a[i] - a[i])(a[j] - a[j])] + (a[»] - /3x*)(a[j] - (3x*) 

= (a[i\-Px*)(a[j}-Px*). 
Furthermore, E [(a\i\ - (3x*) 2 ] = E [(a\i\ - en) 2 ] + (a[i] - /3x*) 2 . Therefore, 



E 



v 1=1 / 



T T T 

^ E ^(«W) + ^ E £(*h - ^™ - w 



rp2 



1=1 
T 



i=i j=i 

'1x' 



i=i L 

Since Var(a[i]) is finite, the first term in the right-hand side will vanish as T goes to infinity. The second 
term also vanishes as T goes to infinity due to (fl2l ). Therefore, 



lim E 

T->oo 



i E «w - pA 

V 1=1 / 



0. 



Lemma 2: Recall that it* [t] is the outcome of the scheduling algorithm (0) at every time step t. Then, 
for every link I G C, 



T-l 



a.s. 



t=o 
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for some fi* such that A/ < V7 G £. 

In other words, the outcome of the scheduling algorithm converges to a set of link rates that can support 
the given set of flow arrival rates. 

Proof: Since the Markov chain of shadow queues is positive recurrent, the proof follows the ergodic 
theorem and the fact that 7r* is upper-bounded. ■ 
To be consistent with [14], we introduce the concept of packet class. Each flow / consists of \L(f)\ 
packet classes; each class going through one link in the route of /. We let S denote the set of all packet 
classes. In other words, there is a bijection mapping a pair (/,/), / G J 7 , I G L(f), to a packet class 
seS. Clearly, |<S| = 

For each flow / G T, let <&(/) be the set of packet classes belonging to /. For each link I G C, let 
C(l) be the set of packet classes going through I. Conversely, for each packet class sC5, let f(s) be 
the corresponding flow (i.e., s G $(/(s))), and l(s) be the corresponding link. 

Let H denote the constituency matrix with size \C\ x |«S|: 



1 if s G C(0, 
otherwise. 



Also, let R be the routing matrix with size |5| x |«S|: 

Rs,u = 

Next, let E s (t) denote the total external arrivals of packet class s up to time t. Thus, 

i-l 

a f [k] if s is the first hop of f(s), 

k=0 

otherwise. 
Also, we define the arrival rates corresponding to packet classes: 



1 if f(s) = f(u) and u is the next hop of s in the route of /, 
otherwise. 

m 

t-i 

E s (t) 



x*j if s is the first hop of f(s), 
otherwise. 



We then extend the definition of E s (t) for continuous time as follows: for each time t G E s (t) := 
E s ([t\) . Hence, E s (t) is right continuous having left limits in time. 

Recall that ir*[t] is the outcome of the scheduling algorithm at time slot t. Now, for each t G K + , we 
let Mi(t) := Mi(\t\) = Y^k^o^i M denote the total amount of service (in terms of number of packets 
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that can be transmitted) of link I up to time t. Now, for each s € 5, let us define m s (t) := Mu s \(t), 
and define 

M(t) := Mag ( . . . , . (13) 

Similarly, let us define A s (t) = A s ([t\) as the total arrivals, and D s (t) = D s ([t\) as the total 
departures, of packet class s up to time t. Thus, 

A s (t) = E s (t) + £ D u (t)R u , s . (14) 
ue<s 

Let Q s (i) = Q s ([tj) be the number of packets of packet class s which are waiting to be served. Then, 

Qs(t) = Q s (0)+A s (t)-D s (t). (15) 
Recall that P t (t) = Pi is the length of FIFO queue at link I at time t. Thus, 

Pi{t) = Q>® =J2H l , a Q„(t). (16) 

seCQ) s 

Now, we define 

• T s (t) as the amount of time that the server at link l(s) has spent serving packet class s in [0, i]; 

• I;(t) as the amount idle time of the server at link I during [0, t]; 

• Wi(t) as the immediate workload at the server of link I, measured in units of time. 
Then we have the following equations: 

Y J Hi, s T s {t)+I l {t) = t (17) 

s 

Wiit) = j^L J2 H l>s (A s (t) + Q s (0)) - Hi, s T s (t), (18) 

and the fact that Ii(t) can only increase when Wi(t) = 0, i.e., if ^(^2) > Ii(ti) then Wi(t) = for some 
t G [tiM- 

We can rewrite the above equations (fl4l - ([T8T ) in vector form to get the following set of equations 
which describes the evolution of the system: 

A(t) = E(t) + R T D(t) (19) 

Q(t) = Q(0) + A(t) - D{t) (20) 

P(t) = HQ(t) (21) 

HT(t) + 1(f) = et (22) 

W(t) = HM(t) [A(t) + Q(0)] - HT(t) (23) 

Ii(t) can only increase when Wi(t) = 0, I € C, (24) 
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where M(t) is defined in (fT3l and e = [1, 1, . . . , 1] T . Additionally, we have that 

M(t)D(t) < T(t) < M(t)(D(t) + e) (25) 
£ s (t + W, (8) (t)) = Q s (0) + A,(t) (26) 

where Equation (|25T ) comes from the fact that each class has at most one packet being served at any 
time, and Equation (f26b comes from the FIFO property of the real queues. 

Note that E s (t), Mi(t), A s (t), D s (t), Q s (t), and Wi(t) are right continuous having left limits, while 
T s (t) and I^t) are continuous in time. We also assume that A(0) = D(0) = T(0) = 1(0) = 0. 

Let us define 

X(t) := (A(t),D(t),Q(t),W(t),T(t),I(t),Q(t)), 

where Q s (t) = Q s (\t\) is the shadow queue of class s. Then X(t) is a Markov process. Furthermore, 
Q(t) and (Q(t),Q(t)) are themselves Markov processes. By Theorem [Q we know that Q(t) is positive 
recurrent. 

We now describe the fluid model of the system. The set of fluid model equations is as follows: 



A(t) = p\t + R 1 D(t) (27) 

Q(t) = Q{0) + A(t) - D(t) (28) 

P(t) = HQ(t) (29) 

HT(t) + I(t) = et (30) 

W(t) = HM[A(t) + Q(0)] — HT(t) (31) 

Ii(t) can only increase when Wi(t) = 0, I € £ (32) 

T(t) = MD(t) (33) 

D s (t + W l(s) (t)) = Q s (0) + A s (t), (34) 



where M* = diag [ i , -Ar , . . . , — ^ ) , and m* = n* u ,. Recall that n* is defined in Lemma [2] as the 

\ m i m 2 rn \s\ J y s f 

set of supporting link rates. Equation d32l means that for each t > 0, whenever W/(f) > 0, there exists 
5 > such that + (5) = //(t - 5), i.e., is constant in (t - 5, t + (5). 
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A. Preliminaries 

Theorem 4 (see /fi6l/): If random variables Z n and Z satisfy that Z n —> Z, where the notation (— ^) 
denotes the convergence in distribution (weak convergence), and if the Z n are uniformly integrable, i.e., 



lim sup / |Z n |dP = 0, 

n J\z n \>a 

then Z is integrable and 

limE[Z n ] =E[Z\. 

n 

Consider the sequence of scaled processes 

X r (t) = -X(rt), t>0, r = 1, 2, . . . , 
r 

then we have the following theorem: 

Theorem 5 ( / f771/ . /ITffl/).- Suppose that, for any sequence of scaled processes X r (t) satisfying ||(<5 r (0), Q r (0)) \\ 
1, r — > oo, there exist a subsequence — > oo and a constant T > such that 

lim E\\(Q rk (t),Q rk (t))\\ = 0, Vt>T. 

r fc — i>oo 

Then the queueing system is stable, in the sense that the Markov process (Q(t),Q(t)) is positive recurrent. 
Corollary 1: Suppose that there exists a deterministic function 

X(t) = (A(t),D(t),Q(t),W(t),T(t),I(t),Cj(t)) 

such that the following conditions hold: (i) 

1) For any sequence r — s> oo, there exists a subsequence — s> oo such that X Tk (-) -^4 X(-) as 
r k -> oo. 

2) For any X(t) satisfying ||(Q(0), Q(0))|| = 1, there exists a T > such that \\(Q(t), Q(t))\\ = 
0, Vt > T, 

3) (Q r (t),(5 r (t)) is uniformly integrable for all t > 0, 
then the original process (Q(t),Q(t)) is positive recurrent. 

Proof: 

From conditions (i) and (ii), we have that ||(Q(i), Q(t))|| — - > for all i > T as r — > oo. Along with 
condition (iii), Theorem H yields that lim^oo E[||(Q(i), Q(t))\\] = 0, Vt > T. We then apply Theorem[5] 
to get the result. ■ 
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B. Proof's details 

For an integer d > 1, let B d [0, oo) be the set of functions / : [0, oo) — > %t d that are right continuous on 
[0, oo) having the left limits on (0, oo). For t > 0, we use f(t—) to denote lim. s ^ t f(s). By convention, 
/(0-) = /(o). 

Let us endow the function space B d [0, oo) with the Skorohod J\— topology. We now define the 
convergence of a sequence of functions in B d [0, oo) under that topology. Let A denote the set of strictly 
increasing, continuous functions / : 5? + — > 5? + such that /(0) = and limt_ > . 00 f(t) = oo. 

Definition 1: A sequence {f n } C B d [0, oo) is said to converge to / € B d [0,oo) in the Skorohod 
topology if for each t > 0, there exists {A n } C A such that 

lim sup |A n (s) - s\ = 

n ^°°0<s<t 

lim sup \f n (\ n (s))-f(s)\ = 0. 

n ^°°0<s<t 

Next, let us define the convergence under the uniform topology. 

Definition 2: A sequence {f n } C B d [0, oo) is said to converge to / € B d [0, oo) uniformly on compact 
intervals (u.o.c.) as n — > oo, denoted by f n — >■ / u.o.c, if for each t > 0, 

lim sup \f n (x)-f(s)\=Q. 

n ^°°0<s<t 

Note that B d [0, oo) under the Skorohod topology is separable, while B d [0, oo) under the u.o.c. topology 
is not. However, if the limit point / is continuous, the two notions of convergence are equivalent. We 
let C d [0, oo) denote the set of continuous functions / : [0, oo) — > $t d . 

Now, consider the scaled process 

X r {t) = -X(rt), t > 0, r = 1, 2, . . . . (35) 

r 

The processes X(t) and X r (t) take values in B^[0,oo), where K = 8|<S| + 2\C\. 
Lemma 3: For any sequence r — > oo, there exists a subsequence — > oo such that 

X rk (t)^X(t) (36) 

for some X(t) € B A [0, oo). Moreover, X(t) is continuous with probability one. 
Proof: By Lemma [TJ we have that 

lim E(T) = p\T m.s. 

T->oo 

Thus, for each t > 0, -E r (t) converges to j3\t in probability as r — > oo; i.e., given any e > 0, 

lim P(W(i)-/3Ai|| > =0. 

r— s-oo \ 2/ 
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Furthermore, if \\E r (t) - /3Xt\\ < § , then 

\\E r {h)-E r (t 2 )\\ < \\E r (h) -p\tx\\ + \\E r (t 2 ) -p\t 2 \\ +l3X\h -t 2 \ < 0X\h-t 2 \ + e. 
Therefore, E r (t) is "asymptotically Lipchitz"; i.e., for any e, 

P(||£ r (ii) -£ r (£ 2 )|| <pX\ti-t 2 \ + e) -> 1 as r -> oo. 
Also, from Lemma 12 we have that for each I £ £ : 

lim M { (T) = /i?T a.s. 

T— s-oo 

Furthermore, the processes -D(i), Q(t),I(t), T(t), Q(t) have bounded increments. Thus, it is easily to see 
that X r (t) is "asymptotically Lipchitz"; i.e., for any e, there exists L > such that 

P(\\X r (t 2 ) -X r (t!)\\ < L(t 2 -h) + e) -> 1 as r ^ oo. 

This implies the sequence {X r (t)} is relatively compact (ref. Corollary 3.7.4, lTT9l ), i.e., the family of 
their associated probability distributions, denoted by {V r {-)} , is relatively compact. Thus, there exists 
a subsequence of {V r {-)} which converges to some V(-) under the Prohorov metric. This then implies 
that there exists a sub-sequence of {X r (t)} which weakly converges to some X(t) (ref. Theorem 3.3.1, 
lTT9l ). Moreover, it follows from this weak convergence and the asymptotic Lipchitz property that the 
limit X(t) is continuous with probability one (ref. Theorem 3.10.2, |[T9l ). ■ 
We call any X(t) satisfying (l36l) a fluid limit of X(t). Given a fluid limit X(t) and the converging 
subsequence {X rk (t)}, the Skorohod representation theorem implies that there exist some processes 
and {X rfc (t)} in a common probability space such that X(i) and X rfe (t) have the same probability 
distributions as X(t) and X r ' k (t), and that X T ' fc (i) converges to X(i) almost surely under the Skorohod 
topology. Furthermore, since the limit point is continuous with probability one, that convergence is in 
the almost sure sense under the uniform topology, i.e., 

X rk (t) -> X(i) u.o.c. with probability 1. (37) 

Now, let us abuse the notations by denoting 

X(t) = (A(t),D(t),Q(t),W(t),T(t),I(t),Q(t)), 

where the components of X(t) satisfy the set of equations dT9l>-(l26T>. Also, let 

X(t) = (A(t),D(t),Q(t),W(t),T(t),I(t),Q(t)) 
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be a corresponding fluid limit; i.e., the condition (I37T ) holds as — > oo. We will show that the components 
of X(i) satisfy the set of fluid model equations (I27l)-(l34l). as stated in the following lemma. 

Lemma 4: Any fluid limit X(i) in 071 ) satisfies the set of fluid model equations d2Tb-(f34b- Also, the 
component Q(t) of X(i) is zero for all t. 

Proof: First, combining Lemmas [T||3l and (l37l ). we know that 

lim E Tk (t) = f3\t u.o.c. with probability 1, 

r fc — >oo 

lim M[ k (t) = fiit u.o.c. with probability 1. 

r k — >oo 

Then, it is easy to see that X(t) satisfies d27j-(|3D, <l33i and (® since X(i) satisfies (fl9Tl-(l23i (l25i and 
(l26l) . respectively. To prove (l32l ) for X(i), we need to show that for each t > 0, whenever Wi(t) > 0, then 
there exists 6 > such that Ij(t + <5) = Ji(i-£), i.e., Jj(s) is flat in (t-5,t + 8). Suppose that Wi(t) > 
for any t > 0. Since Wj(i) is continuous, there exists a 5 > such that e = min^^ t+( n ^(s) > 0. 
Since X(-) is a fluid limit, there exists a sample path oj such that 



u.o.c. 



(W r *(;u),I r *(;u))^{W(-),I(-)) 

as rfc — >• oo. In particular, there exists an integer iV such that 

inf W[ k (s,co) > e/2 
se(t-5,i+<5) 

for r fe > TV. It means that Wj(s,o;) > for s G (r fe (i - 5), r&(i + <5)) and r fc > iV. Thus, by d24b . Ii(s,u) 
is flat for s G (^(^ — 5): r k{t + ^)) when > TV, or equivalently, lT k (s, co) is flat for s G (t — 5, t + 5). 
Letting — > oo, we have that /;(s) is flat for s G (t — 5, t + <5), and hence we prove d32l ). Finally, from 
the positive recurrence of Q(t), it is easy to see that Q(t) is zero for all t. ■ 
So the final step is to show that any solution to the set of fluid model equations (T27T)- (l34l) is stable. In 
fact, this is true by Bramson's result lfl4ll . It thus completes the proof of Theorem 12 



Appendix B 
Proof of Theorem [3] 

Recall that L(f) is the set of links forming the route of flow /. Now, we let R(f) denote the set of 
nodes forming the route of / (and hence, \R(f)\ = \L(f)\ + 1). For each pair (/, n) such that n G R(f), 
we abuse the notation by letting 

• n + 1 denote the next node of n in the route of / (n ^ e(/)); 

• n — 1 denote the previous node of n in the route of / (n &(/))• 



For each n G R(f), let us define 

7r «t(/,n) M : = Tfn,rH-l)M ' n ^ e (/)' 

n = b(f), 



The queue dynamics are given by 

Qn[* + !] = (OnM ~ ^outtf.n) [*]) + ^m(/,n)M- 

Now, consider the Lyapunov function 

feTneR(f) 

We can rewrite the queues' dynamics d38l ) as follows: 



where 



Ql[t + l] = Ql[t] [t} + [t]+ui[t] 



u f n [t] = < 



if Q J n [t] > 7r au4 ( /)n) [t], 

-Qli M + ^tmt(f,n) W if <2n [*] < ^out(f,n) M • 

The drift of the Lyapunov function is given by 



AV[t] := E[V(Q[t + l])-V(Q[t])\Q[t}] 



E E E 2QJ[[t] ^out(f,n) 



feTneRtf) 

+ 2u£[i]7r inC/> )[t] + (it£[t]) + 2u{[t] - 7W (/in) [t] 

Recall that 7r in ( /)fl )[t] = Tr^/^i) [i] - it£_i[£], n 7^ &(/). Thus, we get 



AF[t] = Bi[t] + J2Q f b(f) [t]X f -J2 E (<3n[*] - E [^ m [i] Q[t] 

/eJ- feJ 7 (n, m )eL(f) 

= B 1 [t) + J2Q f Kf) m- E < m W max 
where the last equality is due to the back-pressure scheduling algorithm, and 

B ^ = ^E E E [hn(/,n)W-^(/,n)M) 2 + (^[t]) 2 - 2u f n _ 1 [t]Q( l [t] 

feTneR(f) 

[t])\Q[t]\. 



2S 



Since A is strictly inside the region A, there exist a positive constant e and a vector of link rates /i 
such that 

Mnm > (1 + e) ^2 x f > and /i € co(r). 

f:(n,m)<=L(f) 

Hence, 

E</)W A / = E E 

/e^ fer (n,m)€L(f) 

< — — V /i„ m m ax (QnW - QmW 

J- + e , f:(n,m)eL(f) \ 

Therefore, 

AV[i] < Silt] - «mM " Mnm) , , max - Q/ [t] 

- T~T~ E ^ , , m f X r ,^ ~ QmM 

1 + e , ^ / _ / , f:(n,m)&L(f) V 

Now, for any flow / € J 7 , we have that 

£ ^ W)l E (<9nW - QmW 

neK(/) (n,m)ei(/) 

< £ max (Q^]-g^[t]) 

(n, m)e L(/) 9:(n ' m)ei{9) 

< — E ^ m max (Qm - om + , 

VLU) , 9-(n,m)£L(g) 

where ^L(f) > is the minimum link rate \i nm of any link which is part of the flow's route; obviously, 
> <\f- Thus, for any flow / € J 7 , 

AV[t] < Bx[t]- e A/ J2 Qnlt}- (39) 

neR(f) 

Note that < 6|J-"|K maa ., Vi, for some constant 6 > which depends only on c max (see model 

definition). Thus, by manipulating d39l) . V/ € J 7 , we obtain 



+ 



1 

lim sup — 2_. IE 



E 

ne-R(/) 



1 + €b 'f\K> 



< — ' \T\K 

— \ I J 1 1 1 max • 

e A / 



The above bound along with the positive recurrence of Q[t] gives the desired result. 
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